JUMP-DIFFUSION RISK-SENSITIVE ASSET MANAGEMENT 



MARK DAVIS* AND SEBASTIEN LLEQt 

Abstract. This paper considers a portfolio optimization problem in which asset prices are 
represented by SDEs driven by Brownian motion and a Poisson random measure, with drifts that 
are functions of an auxiliary diffusion factor process. The criterion, following earlier work by Bielecki, 
Pliska, Nagai and others, is risk-sensitive optimization (equivalent to maximizing the expected growth 
rate subject to a constraint on variance.) By using a change of measure technique introduced by 
Kuroda and Nagai we show that the problem reduces to solving a certain stochastic control problem 
in the factor process, which has no jumps. The main result of the paper is that the Hamilton-Jacobi- 
Bellman's equation for this problem has a classical (C^'^) solution. The proof uses Bellman's 'policy 
improvement' method together with results on linear parabolic PDEs due to Ladyzhenskaya et al. 

1. Introduction. Risk-sensitive control is a generalization of classical stochastic 
control in which the degree of risk aversion or risk tolerance of the optimizing agent is 
explicitly parameterized in the objective criterion and influences directly the outcome 
of the optimization. In risk-sensitive control, the decision maker's objective is to select 
a control policy h{t) to maximize the criterion 



J{x,t,h;e) :-- 
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where t is the time, x is the state variable, _F is a given reward function, and the 
risk sensitivity — 0[U]0, oo) is an exogenous parameter representing the decision 
maker's degree of risk aversion. A Taylor expansion of the previous expression around 
9 — evidences the vital role played by the risk sensitivity parameter: 



J(x, t, h-e)=^ [F{x, t, h)] - - Var [F{x, t, h)] 



0{9^) 



(1.2) 



This criterion amounts to maximizing E [F{x, t, h)] subject to a penalty for variance. 
Hence risk-sensitive control differs from traditional stochastic control in that it explic- 
itly models the risk-aversion of the decision maker as an integral part of the control 
framework, rather than importing it in the problem via an externally defined utility 
function. For a general reference, see Whittle [34]. Much of the recent literature 
concerns the infinite time horizon problem: 
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This is interesting from a theoretical perspective, but is not applicable to practical 
asset management because of the non- uniqueness of controls. Optimality in this sense 
is a 'tail property': if h*{t) is optimal, then so is h{t) = h*{t)lt>T + h{t)lt<T for any 
arbitrary process h{t) and time T > 0. Of course, near-term decisions are of primary 
importance to investment managers. 

In the past decade, the applications of risk-sensitive control to asset management 
have flourished. Risk-sensitive control was first applied to solve financial problems 
by Lefebvre and Montulet [25 in a corporate finance context and by Fleming [T^] 
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in a portfolio selection context. However, Bielecki and Pliska U were the first to 
apply the continuous time risk-sensitive control as a practical tool that could be used 
to solve 'real world' portfolio selection problems. They considered a long-term asset 
allocation problem and proposed the logarithm of the investor's wealth as a reward 
function, so that the investor's objective is to maximize the risk-sensitive (log) re- 
turn of his/her portfolio or alternatively to maximize a function of the power utility 
(HARA) of terminal wealth. They derived te optimal control and solved the asso- 
ciated Hamilton- Jacobi-Bellman (HJB) PDE under the restrictive assumption that 
the asset and factor noise are uncorrelated. This assumption is unrealistic and it was 
later relaxed (see [7]). The contribution of Bielecki and Pliska to the field is immense: 
they studied the economic properties of the risk-sensitive asset management crite- 
rion (see 1^), extended the asset management model into an intertemporal CAPM 
([7]), worked on transaction costs ([S]), numerical methods (13 J and considered fac- 
tors driven by a CIR model ([5]). A major contribution was made by Kuroda and 
Nagai [23' who introduced an elegant solution method based on a change of measure 
argument which transforms the risk sensitive control problem in a linear exponen- 
tial of quadratic regulator. They solved the associated HJB PDE over a finite time 
horizon and then studied the properties of the ergodic HJB PDE. Recently, Davis 
and Lleo [TU] applied this change of measure technique to solve, at a finite and an 
infinte horizon, a benchmarked investment problem in which an investor selects an 
asset allocation to outperform a given financial benchmark. 

Risk-sensitive asset management theory was originally set in a world of diffu- 
sion dynamics where randomness is modelled using correlated Brownian motions. To 
our knowledge, the only attempt to extend the risk-sensitive asset management the- 
ory from a diffusion to a jump diffusion setting was made by Wan j33j who briefly 
sketched a jump-diffusion extension of Bielecki and Pliska's 0] original infinite hori- 
zon risk-sensitive asset management model. Wan's treatment is however restrictive 
as it only considers a single Poisson process-driven jump per asset and assumes that 
the underlying valuation factor risks and asset risks are uncorrelated. Our article ad- 
dresses these two limitations. The setting of our control problem, which takes place 
within a finite time horizon, allows for both infinite activity jumps in asset prices and 
for a correlation structure between factor risks and asset risks. To solve this con- 
trol problem we extend Kuroda and Nagai's powerful change of measure technique 
to account for the jumps. One of the difficulties we face in extending this technique 
is proving that the optimal control is admissible as this requires showing that the 
Doleans exponential (|2.8|) associated with this control is a martingale. In a pure dif- 
fusion setting, this would follow easily from the Kamazaki condition or the Novikov 
condition. However, when the Doleans exponential does not have continuous path, as 
is the case in a jump diffusion setting, proving that it is indeed a martingale is more 
difficult as only weaker partial results exist. 

The problem we consider is also related to the vast literature on HARA utility 
maximization. It is not our intension to present here methodical review of the nu- 
merous developments which have occurred in the last 40 years. We will keep our 
discussion brief and focused on references related to risk-sensitive control. In parallel 
with the Bielecki and Pliska, Fleming and Sheu [14] considered a classical optimal in- 
vestment problem in which the objective was to maximize the long-term growth rate 
of expected HARA utility. The authors showed that this problem could also be ex- 
pressed as an infinite time horizon risk sensitive control problem, and then estimated 
the solution for a range of policy constraints and HARA exponent values. Building 
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on their previous results and on Bielecki and Pliska [1], Fleming and Sheu [15 still 
considered the objective of maximizing the long-term growth rate of expected HARA 
utility, but allowed underlying factors to directly affect the securities process. They 
then reformulated the problem as an infinite time horizon risk sensitive control prob- 
lem and studied the associated dynamic programming equations. In Fleming and 
Sheu |16j, the authors applied results from risk sensitive control theory to derive an 
approximately optimal Markovian investment policy. Hansen and Sargent |19j , advo- 
cate the use of risk sensitive control to account for model risk in dynamic economic 
decision models. Their methods and objectives are quite different to those pursued 
here. 

In this paper the asset price processes are modelled as jump-diffusions whose 
growth rates are functions of an auxiliary 'factor' process X{t) which satisfies a lin- 
ear diffusion SDE. Our main results are that the risk-sensitive jump-diffusion asset 
management problem is equivalent to an optimal control problem for a diffusion pro- 
cess (no jumps) and that the HJB equation for the latter admits a unique classical 
C^'^ {[0,T) X R") solution. Showing the existence and uniqueness of a solution to 
a risk-sensitive control problem can prove difficult even in a pure diffusion setting. 
For example, Bensousan, Freshe and Nagai [2] had to constrain the behaviour of the 
Hamiltonian in order to prove existence of a classical solution. Still in a pure diffusion 
setting, Fleming and Soner (see V.9 in [17]) proved that the value function is a contin- 
uous viscosity solution of the associated Hamilton Jacobi Bellman Partial Differential 
Equation (HJB PDF) but had to assume boundedness of all coefficients and of the 
derivatives of the reward function. No such strong condition is required to solve the 
jump diffusion problem considered in this article. In fact all our assumptions arise 
naturally from the structure of the risk-sensitive asset management problem. Unique- 
ness follows from a classical verification argument while the proof of existence relies 
on a policy improvement algorithm and on the properties of linear parabolic PDEs. 

The paper is organized as follows. We first introduce the general setting of the 
model in section 2 and define the class of random Poisson measures which will be 
used to model the jump component of the asset dynamics. In Section 3, we formulate 
the jump-diffusion control problem and introduce the change of measure argument 
of Kuroda and Nagai [23]. In a pure diffusion case, this is enough to transform the 
problem into a standard Linear Exponential of Quadratic Regulator. In our jump- 
diffusion setting, the change of measure simplifies the problem by associating the 
HJB PDF given in Section [3?3l rather than the expected Partial Integro-Differential 
Equation containing non-local terms, to the value function. It is striking that an 
optimal control problem for a jump-diffusion model has a solution that is characterized 
in terms of a HJB PDE and not a HJB PIDeS. 

In Section 4 we address two key questions. First, the admissibility of the optimal 
control is no longer guaranteed because the Doleans exponential defining the Radon- 
Nikodym does not have continuous path. This point is addressed in Propositions 
4.3 and 4.4. Second, the Risk- Sensitive Hamilton- Jacobi-Bellman Partial Differential 
Equation (RS HJB PDE) contains a jump-induced control-dependent integral term: 
it is no longer possible to find an analytical solution and the existence of a strong, 
classical solution is no longer guaranteed. However, should we be able to prove the 
existence of a classical C^'^ solution to the RS HJB PDE, then we can prove unique- 
ness and resolve the control problem using a straightforward verification theorem, 
presented in Theorem 4.1 and Corollary 4.2 in Section 4. 

^See 0ksendal and Sulem I22| for a treatment of jump-diffusion control problems. 
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The main remaining difficulty is proving the existence of a strong sohition to the 
RS HJB PDE. The contribution of this article is to show, in Theorem 5.2 and Corol- 
lary 5.4, that the risk-sensitive jump-diffusion control problem we consider admits a 
unique classical C^'^ ([0, T) x M") solution. Showing the existence and uniqueness of a 
solution to a risk-sensitive control problem can prove difficult even in a pure diffusion 
setting. For example, Bensoussan, Frehse and Nagai ^ had to constrain the behaviour 
of the Hamiltonian in their finite time horizon problem to prove existence of a classical 
solution. Still in a pure diffusion setting and over a finite time horizon, Fleming and 
Soner (see V.9 in [T^) proved that the value function is a continuous viscosity solu- 
tion of the associated Hamilton Jacobi Bellman Partial Differential Equation (HJB 
PDE) but had to assume boundedness of all coefficients and of the derivatives of the 
reward function. No such strong condition is required to solve the jump diffusion 
problem considered in this article. In fact all our assumptions arise naturally from 
the structure of the risk-sensitive asset management problem. We obtain our result 
by applying an approximation in policy space in a two-step process: first, we show 
existence on a bounded region and then extend to the unbounded state space. In 
Section 6, we present a summary of our results in Theorem 6.1 and Corollary 6.3 
to conclude our investigation of the jump diffusion risk-sensitive asset management 
problem. 

Up to this point, we have assumed that the factor process X{t) is directly ob- 
served by the controller, and therefore represents real economic factors: GDP growth, 
inflation, the S&P500 index, etc. We may however wish to use X{t) as an abstract 
latent factor, introduced to model volatility of returns, in which case only the prices, 
and not X{t)^ will be observed. In our final Section 7, we note that this problem, once 
adequately reformulated, can be solved using a classical Kalman filter, as in [28], as 
the jump noise is absent from the dynamics of X{t). While this is from a technical 
point of view a simple observation, it greatly enhances the applicability of our results. 

2. Analytical Setting. 

2.1. Overview. The growth rates of the assets are assumed to depend on n fac- 
tors Xi(t), . . . , Xn{t) which follow the dynamics given in equation (j2.2|) below. As in 
Kuroda and Nagai's asset-only model, the assets market comprises m risky securities 
Si, z = 1, ... TO. In contrast to Kuroda and Nagai, we assume that the money market 
account process, 5*0, is an affine function of the valuation factors, which enables us to 
easily model a stochastic short term rate. Let M := n + m. 

Let (ri, {J-^t} IP) be the underlying probability space. On this space is defined 
an R^^-valued (J"t)-Brownian motion W{t) with components Wk{t), k — 1,...,M. 
Moreover, let (Z,Bz) be a Borel spacfU. Let p be an (J"t)-adapted cr-finite Poisson 
point process on Z whose underlying point functions are maps from a countable set 
Dp C (0, cxd) into Z. Define 

3p := {U e B{Z), E [Np{t, U)] < oo Vt} (2.1) 

where Np{dt, dz) is the Poisson random measure on (0, oo) x Z induced by p. 



is a Polish space and Sz is the Borel cr-field 
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Our analysis will focus on stationary Poisson point processes of class (QL) with 
associated Poisson random measure Np{dt,dz). The class (QL) is defined in [50] 
(Definition II. 3.1 p. 59) as 

Definition 2.1. An {Ft)-adapted point process p on (57,7^, P) is said to be of 

class (QL) with respect to (J-t) if it is a -finite and there exists Np — (^Np{t,U)^ such 
that 

(i.) for U e 3j3, t ^ Np{t, U) is a continuous {J-t)-adapted increasing process; 

(a.) for each t and a. a. uj £ fl, U Np{t, U) is a a-finite measure on (Z,B(Z)); 

(Hi.) for U G 3p, t M> Np(t, U) = Np{t, U) — Np(t, U) is an {J-t) -martingale; 
The random measure |7Vp(i, C/)| is called the compensator of the point process p. 
Because the Poisson point processes we consider are stationary, then their compen- 
sators are of the form Np{t, U) = i'{U)t where u is the cr-finite characteristic measure 
of the Poisson point process p. 

Finally, for notational convenience, we fix throughout the paper a set Zq G Bz 
such that i^(Z\Zo) < oo and define the Poisson random measure Np{dt,dz) as 

Np{dt,dz) 

__ ( Np{dt, dz) - Np{dt, dz) = Np{dt, dz) - v{dz)dt Np{dt, dz) \i z^Z^ 
~ \ Np{dt,dz) if z G Z\Zo 

2.2. Factor Dynamics. The dynamics of the n factors can be expressed through 
the affine diffusion equation 

dX{t) ^ {b + BX{t))dt + AdW{t), X{0)^x (2.2) 

where X{t) is the R"-valued factor process with components Xj{t) and b G M", 
B G M"^" and AM"^^^. 



2.3. Asset Market Dynamics. Let 5*0 denote the wealth invested in the money 
market account with dynamics given by the equation: 

= (ao + A'aX{t)) dt, 5o(0) = So (2.3) 

where ag G M is a scalar constant, G M" is a n-element column vector and through- 
out the paper x' denotes the transpose of the matrix or vector x. 

Let Si{t) denote the price at time t of the ith security, with i = 1, . . . , m. The 
dynamics of risky security i can be expressed as: 

^0- = {a + AXit)),dt + y2a,kdWkit)+ [ -f,{z)Np{dt,dz), 

S,{0) = s„ i = l,...,m (2.4) 

where a G M™, A G K'"^", S := [a,.j] , i = 1, . . . , m, j = 1, . . . , Af and 7(2) G K™ 
satisfying Assumption [TJ 

Assumption 1. 7(2) g M™ satisfies 

- 1 < 7,""'" < 7^(2) < <+oo, t = l,...,m 



and 



1 < 7r'" < < iT"" < +00, 



i — 1, . . . ,m 



for i — 1, . . . ,m. Furthermore, define 



S := su'pp(v) e Bz 



and 



where supp{-) denotes the measure's support, then we assume that 11^=1 [tI"*" ' Tf'"^] 
is the smallest closed hypercube containing S. 

In addition, the vector-valued function 7(2) satisfies: 



Note that Assumption [T] requires that each asset has, with positive probabihty, 
both upward and downward jump. As will become evident in Section |3.2[ the effect 
of this assumption is to bound the space of controls. Relation (|2.5I) is a standard 
condition. See Definition II. 4.1 in Ikeda and Watanabe [50] H. 

Define the set J' as 



For a given z, the equation h'j{z) = — 1 describes a hyperplane in R™. is a convex 
subset of R" for aU {t,x) e [0,T] x R". 

2.4. Portfolio Dynamics. In this paper, we will limit ourselves to the case 
9 > 0, which implies that the investor has positive risk aversion. In our analysis, we 
make the following assumption: 

Assumption 2. 



The effect of Assumption [2] is to prevent redundant assets. For example, we will 
not able to model in our investment market a share and an option or futures on that 
share. However, this assumption leaves us free to model a wide range of assets such 
as shares, bonds and commodities products as well as related indexes. 

Let Gt ■= (t{{S{s),X{s)),0 < s < t) he the sigma-field generated by the security 
and factor processes up to time t. 




(2.5) 




(2.6) 



EE' > 



^In |20| . Fp and F^'°'^ are respectively given in equations 11(3.2) and 11(3.5) 
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An investment strategy or control process is an R^-valued process with the in- 
terpretation that hi{t) is the fraction of current portfoho vahie invested in the ith 
asset, i — l,...,m. The fraction invested in the money market account is then 

ho{t) = i~j::uh,it). 

Definition 2.2. An W"" -valued control process h(t) is in class % if the following 
conditions are satisfied: 

1. h{t) is progressively measurable with respect to {B([0, t]) 0t}t>o '"^'^ '■s cddldg; 

2. |/i(s)|^ ds < =1, VT > 0; 

3. h'{t)-f{z) > -1, Vt > 0, z e Z, a.s. dv. 
Define the set /C as 

IC:^{hen:heJ for a.a.t} (2.7) 

Lemma 2.3. Under Assumption]^ a control process h{t) satisfying condition 3 
in Definition \2.2\ is bounded. 

Proof. The proof of this result is immediate. □ 

Definition 2.4. A control process h{t) is in class A{T) if the following conditions 
are satisfied: 

1. hen; 

2. ^Xt ~ ^ where Xt, t G (0,T], is the Doleans exponential defined as 



expl^-^ej^ h{s)'mWs~-]^0^ h{s)'YJ:'h{s)ds 
hi (1 - G{z, h{s); 6)) Np{ds, dz) 



10 JX 

[ / {In (1 - G{z, h{s); 6)) + G{z, h{s);e)} v{dz)ds 
Jo Jz 



and 

Giz,h;e) ^1- {l + h'-f{z))~'^ (2.9) 

Definition 2.5. We say that a control process h{t) is admissible ifh{t) S A{T). 

The proportion invested in the money market account is hgit) = 1 — X^IIli ^ii^)- 
Taking this budget equation into consideration, the wealth, V{t) of the investor in 
response to an investment strategy hit) e H, follows the dynamics 

- (ao + AoX(i)) dt + h'{t) (a - oqI + {A- IA'q) X(t)) dt 



v{t-) 

+h' {t)-i:dWt + I h'{t)-f{z)Np{dt,dz) 



where 1 G denotes the m-element unit column vector and with 1^(0) = v. Defining 
a :— a ~ oqI and A :— A ~ l^o, we can express the portfolio dynamics as 

= (ao + AgXit)) dt + h'(t) (a + AX{t)) dt + h'{t)j:dWt + [ h' {t)-f{z)Np{dt, dz) 
^[t ) ^ ^ Jz 

(2.10) 

with initial endowment 1^(0) = 0. 
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3. Problem Setup. 

3.1. Optimization Criterion. We will assume that the objective of the in- 
vestor is to maximize the risk adjusted growth of his/her portfolio of assets over a 
finite time horizon. In this context, the objective of the risk-sensitive management 
problem is to find h* (t) G -4(T) that maximizes the control criterion 



J{x,t,h;e) := -^InE 



^-einV{t,xJi) 



(3.1) 



Hence, 



where 



By Ito, the log of the portfolio value in response to a strategy h is 
\nV{t)=\nv+ f {ao + A[fX{s)) + h{sy(^a + AX{s)^ds-^f /i(s)'SS'/i(s)(is 



h{sYJ:dWis) 
t p 

{In (1 + h{sy-f(z)) - hisy-f{z)} v{dz)ds 
\n{l + h{sy-^{z))Np{ds,dz) 



J7.0 

t 



Jz 



g{Xs,h[sye)ds\ xt 



(3.2) 



(3.3) 



g{x, h;9)^ -{9 + 1) - oo - A'qX - h'{a + Ax) 



1 r 



{l + h'jiz))-" - 
and the Doleans exponential Xt is given by (|2.8 



+ h'j{z)lz„{z)}i^{dz) (3.4) 



3.2. Change of Measure. Let be the measure on (51, J^t) defined via the 
Radon-Nikodym derivative 

For a change of measure to be possible, we must ensure that the following technical 
condition holds: 

G{z,h{s);9) < 1 

This condition is satisfied iff 

h'{s)-f{z) > -1 (3.6) 

a.s. dv, which was already required for h{t) to be in class TL (Condition 3 in Defini- 
tion l2.2[) . Condition (|3.6p is endogenous to the control problem and can be interpreted 



as a risk management safeguard preventing the investor from investing in some of the 
portfohos if the jump component of these portfohos could result in the investor's 
bankruptcy. One could contrast this change of measure condition with the introduc- 
tion of stopping time to track bankruptcy time and define the solvency region in the 
Merton model (see For example Mertom [?]), since both are related to the risk that 
the investor might go bankrupt. Two main differences exist between these approaches. 
First, in the jump diffusion version of the Merton model, bankruptcy could arise from 
either large enough jumps in the asset prices or large enough diffusive change in price. 
However, in our JDRSAM model the only movement which could potentially result 
in bankruptcy arises from jumps. This is due to the geometric nature of the model 
which prevents bankruptcy in the pure diffusion case. Second, in the Merton model, 
the solvency constraint is exogenous to the control problem: it has to be imposed 
through the introduction of a stopping time. On the other hand, in our JDRSAM 
model the constraint is endogenous to the control problem: it arises naturally from 
the change in measure. 

Observe that is a probability measure for h G A{T). 

For h e A{T), 

Wl" + 9 [ S'/i(s)ds 

Jo 

is a standard Brownian motion under the measure and we have 

f f N^{ds,dz)= f f Np{ds,dz)- f f {l~G{z,h{s);e)}iy{dz)ds 
Jo Jz Jo Jz Jo Jz 

^ lo I ^P^^''^'^^^ ^ J {i'^ + h'liz)r'^}''idz)ds 
As a result, X(t) satisfies the SDE: 

dX{t) = {b + BX{t)-9AY.'h{t))dt + AdWl}, te[0,T] (3.7) 

We will now introduce the following two auxiliary criterion functions under the 
measure P^: 

• the auxiliary function directly associated with the risk-sensitive control prob- 
lem: 

Iiv,x;h;t,T;9)^-^ln-E'^; 

where Et^x [■] denotes the expectation taken with respect to the measure P^ 
and with initial conditions (i, x). 

• the exponentially transformed criterion 

i{v,x,h;t,T;e) := E^;, 
which we will find convenient to use in our derivations. 

We have completed our reformulation of the problem under the measure P^. The 
state dynamics (j3.7l) is a diffusion and our objective is to maximize the criterion p.8p 
or alternatively minimize (13. 9p . 
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exp < 9 



l{X„h{s);9)ds -e\nv 



(3.8) 



exp 




9lnv 



(3.9) 



3.3. The Risk-Sensitive Control Problems under P^. Let $ be the value 
function for the auxihary criterion function I(v,x; h;t,T). Then $ is defined as 

$(t,a;)= sup I{v,x;h;t,T) (3.10) 

heA(T) 

We will show that $ satisfies the HJB PDE 

— (t,a;) + supLj'$(t,x) =0, (t, a;) £ (0, T) x M" (3.11) 
ot hej 



where 



L'mt, x)^{b + Bx- 9AT.'h{s))' D$ 



+ itr (AA'L>2$) - ^(i:)$)'AA'D$ - g{x, h; 9) (3.12) 

and subject to terminal condition 

$(r,x)=lnw, xeM" (3.13) 



Similarly, let $ be the value function for the auxiliary criterion function /(w, x; /i; T). 
Then $ is defined as 

l>(i,x)= inf i{v,x;h;t,T) (3.14) 



"(t, a;) + itr (AA'i:>2$(t, x)^ + H(t, x, $, £>!>) = (3.15) 



The corresponding HJB PDE is 
'dt 

and subject to terminal condition 

$(r,a;) = u"^ (3.16) 

and where 

H{s,x,r,p) = m^^^^{b + Bx - OAY.' h{s))' p + eg{x, h; 9)r^ (3.17) 

for r e M, p e M" and in particular, 

$(t,a;) = exp{-6l$(i,x)} (3.18) 

Note that since $ and $ are related through a strictly monotone continuous 
transformation, an admissible (optimal) strategy for the exponentially transformed 
problem is also admissible (optimal) for the risk-sensitive problem. 

The supremum in (I3.11|) can be expressed as 
sup 

1 9 
^{b + Bx)' £>$ + -tr {AA'D^<^>) - -(£>$)' AA'D* + ao + A'^x 

+ sup \~-{9+ 1) /I'EE'/i - 9h'j:A'D<i> + h'{a + Ax) 

~l /z { [^^ + '^'^(^))"' - l] + ^/i'7(^)Jz„(^)} Krf^)} (3.19) 
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Under Assumption [21 for any p e R" the terms 



- (61+ - eh'Uk'p 



h'( 



Ax) 



h'j{z)lz„iz)iy{dz) 



and 



I J^{[i^ + h'l{z)r''-l]}Hdz) 



are both strictly concave in /i Vz G Z a.s. di'. Therefore, the supremum is reached for 
a unique maximizer h(t,x,p), which is an interior point of the set J7 defined in equa- 
tion (j2.6p . and the supremum, evaluated at h(t,x,p) S M", is finite. By measurable 
selection, h can be taken as a Borel measurable function on [0,T] x M" x M". 



4. Verification Theorems. In this section, we prove a verification theorem to 
the effect that if (|3.f f p has a C^'^ solution then that solution is equal to $ defined 
by (13.1 op and the control h*{t) = h{t, x, 1?$) is optimal. We will first prove a verifica- 
tion theorem for the exponentially transformed problem p.l4p with HJB PDE ()3.15p 
and value function $(i,a;). As a corollary, we will obtain a verification theorem for 
the risk sensitive control problem with (|3.10p . HJB PDE p.Iip and value function 
^{t,x). Define the second order operator 

L'^ifit, x) = {b + Bx- 9AJ:'h{s))' Dip{t, x) + eg{x, h- 9)(p{t, x) (4.1) 



Theorem 4.1 (Verification Theorem for the Exponentially Transformed Control 
Problem). Let ^ be a C^'^ ([o,r] x M") bounded function, 
(i) Assume that 4>{T,x) < g-'^i"^ Va; e M" and 

^{t, x) + ^tr {KK'D'^4>{t, a;)) + H{t, x, 4>, D4>) > 

on [0,T] X M", then 4>{t,x) < ^{t,x) y{t,x) £ [0,T] x R" 
(ii) Further assume that 4>{T,x) = e-"'"" Vx G M" and there exists a Borel- 
measurable minimizer h*{t,x) of h L'^cj) defined in (j4.ip such that 

^{t, x) + ^tr [kk'D^{t, x)) + Hit, X, 4>, D4>) 

= ^ (t, ^) + ^ tr [aA' D^{t, x)) + L^' ^ 
= 

and the stochastic differential equation 

dX{t) = {b + BX{t) - eAT.'h{t)) dt + AdW^ 

defines a unique solution X{s) for each given initial data Xt — x and the process 
7r*(s) :— h*{s, X(s)) is a well-defined control process in A(T). Then (j) — ^ and 
7r*(s) is an optimal Markov control process. 
Proof. The following proof is based on an argument used by Touzi [32] . 
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(i). Let h e A{T) be an arbitrary control, with X{t) the state process with initial 
data X{t) = X. Define the stopping time 

rjv := T A inf {s > t : \Xs - x\ > N} 

Define Z{s) = 9 g{s, Xg, hg] 9)ds, then 



d{e^') ■.= eg{s,XsX\0)e^' 



Also, by Ito, for s e \t, r^] , 



where £ is the generator of the state process X{t) defined as: 

/:4>{t, x) := {b + Bx- 6lAS'/i(s))' D4> + ^tr (aA'D^^) 



By the Ito product rule, and since dZ^ • 0^ = 0, we get 
d (0se^=) = ^sd {e^') + e^^d^s 
and hence for s e [i, tjv] 

4){s, Xg)e^^ = 4>{t, x)e^' +e 4){u, Xu)g{u, X„, K; e)e^"du 

(^^{u, Xu) + C^{u, X„)e^" j '^'^ + / D4>'AdW: 
Because for an arbitrary control h, 

^{t,x) + ^tr (^AA'D^4>{t,x)) + d'4>{t,Xt) 

- + (AA'D20(i,x)) +Hit,x,4>,D^) 

> 

and e^' > Vs G [t, tn] we have 

4>{t,x)e^' < 4>{s,Xs)e^^ + D4)'AdWi 
Taking the expectation, wo obtain 



12 



In particular, take s = tm and note that e^* = 1, then 



(TAr,XT-„)e 



61/7™ g{u,X^,h^;9)du 



Since ^ is assumed to be bounded, there exists a constant Ci > such that: 



Since for an arbitrary admissible control h G •A.{T) and fixed s G [t, T] there 
exists some constant C2 > such that 



g{s,X,,K;e) <C2\1 + X{s) 



Then 



for C3 = Cie<^= and C4 = Cse^C^"*). 

By the dominated convergence theorem and the assumption that (j){T, Xt) < 

„-9lnv 



< E 



Wc have now proved the first part of the theorem, 
(ii). To prove the second part, we can simply apply the same reasoning for the 
optimal control h* . Note, however, that with this choice of control we would 
have 

'dt 



{t, x) + itr (^AA' D^4>{t, x)) + H{t, x, 4>, D^) 
^ (i , x) + itr [aA' D^{t, x)) + i}' ^ 



= 

which would lead us to equality in the last equation, i.e. 

~^{t,x) = Eff Le g(u,X^M)du _Ql^y 



Corollary 4.2 (Verification Theorem for the Risk-Sensitive Control Problem). 
Let (I) be a C^'^ ([0, T] x M") n C ([0, T] x M") bounded Junction. 
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(i) Assume that (j){T,x) < e^^^"" Va; G R" and 

^ + sup Ll(f>{t,X(t))>0 
ot h&J 

on [0,T] X R", then (f){t,x) < ^{t,x) V(t,x) e [0,7] x R" 
(ii) Further assume that cj){T,x) = e"^'"'' Va; e R" and there exists a mimmizer 
h*{t,x) o/ /i H- > defined in (j3.12p such that 

^ + sup L'l<l>{t,X{t)) = ^+Lfcl>{t,X{t)) = 
at hf=j at 

and the stochastic differential equation 

dX{s) = {b + BX{s') - 9AJ:'h{s)) ds + AdW^ 

defines a unique solution X for each given initial data Xt = x and the process 
7r*(s) := h*{s,X{s)) is a well-defined control process in A{T). Then <j) = <^ and 
IT* (s) is an optimal Markov control process. 
Proof. This corollary follows from equation p.lSp and from the fact that an 
admissible (optimal) strategy for the exponentially transformed problem is also ad- 
missible (optimal) for the risk-sensitive problem. □ 

Proposition 4.3. The minimizer h*{t,x) of h y-^ L'^cj) defined in (|3.12p is 
admissible: h*{t) G A{T). 

Proof. Refer to Appendix |X] for a full discussion and a proof of this proposition. 

□ 



Applying Proposition |4]3] we deduce that the control h*{t) is optimal for the aux- 
iliary problems (|3.8|) and p.9|) resulting from the change of measure. However, this 
proposition is not sufficient to conclude that h*{t) is optimal for the original prob- 
lem (13.11) set under the P-measure. The next result show that this is indeed the case. 



Proposition 4.4. The optimal control h*(t) for the auxiliary problem sup^g_^(-y^ I{v, x] h; t, T; 
where I is defined in p.Sp is also optimal for the initial problem sup/jg_4(y) Jix, t, h; 9) 
where J is defined in p.ip . 

Proof. Refer to Appendix |A] for a proof. □ 



5. Existence of a Classical Solution. Historically, proving the existence of 
a strong, analytical solution to the HJB PDE was both the main difficulty and the 
main objective when solving a control problem. Fleming and Rishel |13) as well as 
Krylov [?] and [?] have been the main contributors, proposing techniques based ei- 
ther on PDE arguments or on probability theory. Recently however, the emphasis 
has switched from strong solutions to weaker types of solution. Viscosity solutions 
have proved particularly useful and successful, gaining many applications in stochas- 
tic control theory (see for example the classic article by Crandall, Ishii and Lions [?] 
as well as Fleming and Soner [17] for a tour applications ot stochastic control). The 
reason for this appeal is twofold. First, it is significantly easier to prove the existence 
of a viscosity solution than a classical solution. In the viscosity world, the difficulty 
is shifted from proof of existence to proof of uniqueness, and even then it is gener- 
ally easier to prove uniqueness of a viscosity solution via a comparison theorem than 
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the existence of a classical solution. Second, the stability result due to Barles and 
Souganidis [?] connects directly viscosity solutions to numerical methods, making it 
easy to solve 'real world' control problems. 

This section follows similar arguments to those developed by Fleming and Rishel [T5] 
(Theorem 6.2 and Appendix E). Namely, we use an approximation in policy space 
alongside results on linear parabolic partial differential equations to prove that the 
exponentially transformed value functions $ is of class C^'^((0,T) x R"). Then it 
follows that the value functions $ is also of class C^'^((0,T) x R"). The approxima- 
tion in policy space algorithm was originally proposed by Bellman in the 1950s (see 
Bellman [I] for details) as a numerical method to compute the value function. Our 
approach has two steps. First, we use the approximation in policy space algorithm 
to show existence of a classical solution in a bounded region. Then, we extend our 
argument to unbounded state space. To derive this second result we follow a different 
argument than Fleming and Rishel |13j which makes more use of the actual structure 
of the control problem. 

Our interest in classical solutions is as much mathematical as practical. First, 
since a smooth solution is a viscosity solution but the converse is not necessarily true, 
we are proving a stronger result. Second, this stronger result immediately translates 
a better grasp of the analytical properties of the value function. While viscosity 
solutions provide continuity, they do not generally give information about higher 
order derivatives. By contrast, classical solutions are smooth in the state, implying 
that they are (at least) in time and in the state. Third, viscosity solutions 
are purely about solving the PDF and although they show that the value function is 
the unique solution of the HJB PDF they do not prove directly the control problem 
has a solution, that is a pair of a value function and an admissible optimal control. 
Fourth and finally, in our case seeking a strong solution does not impair our search 
for numerical results. Because our state process X{t) can clearly be interpreted as the 
continuous time limit of a Markov Chain, we can apply well-known results by Kushner 
and Dupuis [?] to prove convergence of a finite approximation scheme to the value 
function. We can therefore solve concrete portfolio selection problems quite directly. 

5.1. "Zero Beta" Policies. In this section, we introduce a new class of control 
policies: the "zero beta" (0/3) policies: 

Definition 5.1 (0/?-policy). By reference to the definition of the function g in 
equation (|3.4p . a 'zero beta' (0(3) control policy h(t) is an admissible control policy 
for which the function g is independent from the state variable x. 

The term 'zero beta' is borrowed from financial economics (see for instance 
Black [5]). To avoid assuming the existence of a globally risk- free rate in factor models 
such as the CAPM, the APT or in ad-hoc valuation models, it is customary to build 
portfolios without any exposure to the factor(s) as a substitute for the risk-free rate. 
These special portfolios are referred to as 'zero beta' portfolios by reference to the slope 
coefficient /3 used to measure the sensitivity of asset returns to the valuation factor (s). 

In the risk sensitive asset management model, if Aq = 0, then the policy — 0, 
i.e. invest all the wealth in the risk-free asset, is a 0/3-policy. In the general case when 
^0 7^ 0, the set Z of 0/?-policies is the set of admissible policies h which satisfy the 
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equation 



Note that since m > n, there is potentially an infinite number of 0/3-policies as long 
as the following assumption is satisfied 

Assumption 3. The matrix A has rank n. 

Without loss of generality, in the following we will fix a 0/3 control /i as a constant 
function of time so that 



'lix,h;e) 



where 5 is a constant. 



5.2. The L'^{K) and ^''(/iT),! < rj < 00 Spaces. The following ideas and 
notations relate to the treatment of linear parabolic partial differential equations found 
in Ladyzhenskaya, Solonnikov and Uralceva |24| . The relevant results are summarized 
in Appendix E of Fleming and Rishel. They concern PDEs of the form 



1 

+ i^i*' x)D'^iIj) + b{t, xYDip + ec{t, x^j + d{t, x) = 
on a set Q = {0,T) x G and with boundary condition 



(5.1) 



tp{t,x) = 'i'rix) X e G 

ipit, x) ^ "ifit, x) (t, x) G (0, T) X dG 

The set G is open and is such that dG is a compact manifold of class C^. Denote by 

• d*Q the boundary of Q, i.e. 

d*Q ({T} X G) U ((0,T) x dG) 

• L^{K) the space of 77-th power integrable functions on K C Q; 

• the norm in L''(if). 

Also, denote by ^"^{Q), 1 < 77 < 00 the space of all functions ip such that tp and 
all its generalized partial derivatives are in L^{K). We associate with this space the 
Sobolev-type norm: 



I (2) 



dt 



ri,K 



E 

i=i 



I 9-0 
I dxi 



ri,K 



E 



9V 



ri,K 



(5.2) 



We will also introduce additional notation and concepts as required in the proofs. 



5.3. Existence of a Classical Solution. In this section, we use an approxima- 
tion in policy space to show the existence of a C^'^ solution to the RS HJB PDE (13. lip . 

Theorem 5.2 (Existence of a Classical Solution for the Exponentially Trans- 
formed Control Problem) . The RS HJB PDE (|3.15p with terminal condition l>(T, x) = 
e"*""^ has a solution $ G C^'^ ((0,T) x R") wtth $ continuous m [0,T] x M". 



Proof. Step 1: Approximation in policy space - bounded space 

Consider the following auxiliary problem: fix i? > and let be the open n- 
dimensional ball of radius i? > centered at defined as :— {x G M" : \x\ < R}. 
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We construct an investment portfolio by solving the optimal risk-sensitive asset al- 
location problem as long as X{t) G for R > 0. Then, as soon as X{t) ^ fJ^n, 
we switch all of the wealth into the 0/3 policy h from the exit time t until the end of 
the investment horizon at time T. The HJB PDE for this auxiliary problem can be 
expressed as 



a$ 1 

(5.3) 



-tr (aA' {t)D^^^ + H{t, X, !>, = \/{t, x) G Qr := (0, T) x 



where 



H{s, X, r, p) = mf^ {f{t,x,h)'p + eg{t, x, h)r} 



for p G M" and subject to boundary conditions 

l>(i, x) = ^{t, x) y{t, x) G d*QR := ((0, T) X d£§R) U ({T} x ^h) 

with 

• «'(r,x) = e-"^''" Vx G £i§R; 

• 'bit,x) := ip(t,x) := e'^siT-t) v(f,a;) G (0,T) x d^R and where h is a, fixed 
arbitrary 0(3 policy which is constant as a function of time. Note that is 

11,2/ 



obviously of class C ' {Qr) and that the Sobolev-type norm 
is finite. 



= (5-4) 



Define a sequence of functions l>^,... on Qr = [0,T] x ^r and of 

bounded measurable feedback control laws /i", h^,... h'^,... where /i° is an arbitrary 
control. $^^+1 solves the boundary value problem: 



+ 



, -tr (AA'(i)£)2$'=+i) + f{t, x, h^)'D^^+^ + eg{t, x, h'')^''+^ = 

(5.5) 

subject to boundary conditions 

l>(t, x) = *(t, x) Wit, x) G d*QR := ((0, T) x O^r) U ({T} x ^r) 
Moreover, for almost all {t, x) G Qr, A; = 1, 2, . . ., we define h'^ by the prescription 

= Argmin^e^ {/(t, x, h)' D^^ + %(t, a;, h)^} (5.6) 

so that 

f{t,x,h^)'D^'' + eg{t,x,h'')^^ = inf (/(i,.T,ft)'Dl>'= -h6'5(i,a;,/i)$'=| 

= H{t,x,¥,D¥) (5.7) 
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Observe that the sequence ( ) is globally bounded. Indeed, by Feynman- 

V / fcGN 

Kac, the sequence ( ) is bounded from below by 0. By the optimality principle, 

V / fcGN 

it is also bounded from above by e*-^* a{x(s),h;e)ds _ ^9g(T-t) ^ Moreover, these bounds 
do not depend on the radius R and are therefore valid over the entire space (0, T) x K". 



Note also that the boundary value problem (|5.5I) is a special case of the generic 
problem introduced earlier in equation (15.11) with 

a(t,x) = AA'{t) 
b{t,x)^f{t,x,h'') 
c{t,x) = g{t,x,h'') 
d{t,x) = 



Moreover, since ^Sr is bounded and J" is compact, all of these functions are also 
bounded. Thus, based on standard results on parabolic Partial Differential Equa- 
tions (see for example Appendix E in Fleming and Rishel [13] and Chapter IV in 
Ladyzhenskaya, Solonnikov and Uralceva i24j), the boundary value problem (|5.5p ad- 
mits a unique solution in ^^(Qn)- 

Step 2: Convergence Inside the Cylinder (0,r) x .^ij 

Step 2.1: Monotonicity of the Sequence 

Take k>\. Subtracting the PDE for ■1''+^ from the PDF for , we see that 



1 

-tr 
2 



\~di dT) 

+ (/(t, X, h'^)'D¥+^ - f{t, X, h'^-'YD^'^) + 6 X, h'')^''+' - g{t, x, h"-') 
in(0,r)xR" 



with $'=+1 - = 



Add and subtract /(t, x, h'')' D^'' + ^(t, x, /i'=)<I>'=, 



1 

-tr 
2 



(AA'(Or'^l>'^'+^) - (AA'(t)D2|.' 



\~di dT) 

+ (^f{t, X, h''yD^''+^ - f{t, X, h'^'^yD^''^ + e {g{t, x, h'')^''+^ ~ g{t, X, h''-^)^'' 

+ (/(t, X, h^yo^ + eg{t, X, h'')^''^ - (^f{t, X, h'^yo^'' + 9g{t, X, h'')^''^ 
in(0,r)xE" 
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Rearranging, 

+f{t, X, h^y [d^^+^ - + 6g{t, X, h^) - 

= in(0,r)xM" 



Define the function &{t^x) as 



t{t,x) := (/(i,x,/i'=)'i:>$'= + 6lg(i,x,/t'=)l''= 



By the definition of given in dSH), t{t, x) < V(t, x) £ [0, T] x M", V/c G N. Define 
the sequence of functions (W^'^)fegN as 



then W''^ satisfies the PDE 
BW^ 1 

+ -tr (AA'(t)L'2T^'=) + f{t, X, + 6l5(t, x, /i'=)W^'=^ + ^'^(t, x) = 

(5.8) 

in (0,r)x^ij, and with boundary condition VF'=(r,x) = Oon9*Qfl^ = ((0,r) x 9i^fl)U 
({T} X ^r). 

Define the stopping time tq as the first exit time from iiSn: 

TG ■■= inf : X{t) i G} 
By Lemma [5.31 below . W^(t^x^ can be represented by the expectation 

(■Thro 



W^{t,x) = E 



(5.9) 



Because £{t,x) < 0, W''{t,x) < for fc > 1 and hence by definition of W'', 

> ^fe+i^ Vfc e N 

which implies that the sequence < > 

I J ken 



IS non increasing. 



Step 2.2: Convergence of the Sequence 

Since the sequence ($'^)fceN is non increasing and is also bounded, it converges. Denote 
by $ its limit as fc — > oo. Now, since the Sobolev-type norm ll^'^^^ ll^j^g^ is bounded 
for 1 < ?7 < oo, we can apply the following estimate given by equation (E.9) in 
Appendix E of Fleming and Rishel 



1$ 



fc||(2) 



(5.10) 
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for some constant M/j (depending on R) and where 

n + 2 



i=l 



and 



\'^>%,^ sup \'!>'^{t,x)\+ sup 

0<t<T 
\^''{s,x) - ^''it,x)\ 

< < T 

to show that the Holder-type norm is bomrded. As k oo we conclude that 

• D^'^ converges to D<b uniformly in L^{Qr) ; 

• D^^'' converges to Z3^$ weakly in L^{Qii) ; and 

• converges to ^ weakly in L^[Qr). 

Step 2.3: Proving that $ G C^''^{Qr) 
Using estimate (IS.lOp . we see that l^'^lg^'^ is bounded for /i > 0, which implies that 
1] > n + 2. Using relationship (I5.7P and then equation (I5.5p . we get: 



1 



9i 2 



tr (AA'(t)L»2<|fe^ + /(t^ X, h)'D¥ + 6i.g(<, a;, h)^^ 



> _- + -tr(AA'(t)L'2$M +/(t,x,/i'=)'i:»$'^' + 6l.9(i,a;,/i'^)$'= 



AA'(t)i:>2$'=^) - (AA'(t)i:'2^'^+i 



dt 2 



-tr 



\ 9t dt I \2 
+f{t, X, h^y [d^'' - + 0g{t, X, h'') (^'^ - $'=+1) (5.11) 

for any admissible control h. 

Since the left-hand side of (I5.11|) tends weakly in L'^{Qr) to 

^ + itr (AA'(t)I?2<|) + /(i, X, /i)'i^l> -f 05(t, ft)$ (5.12) 

as fc — > oo and the right-hand side tends tends weakly to 0, then we obtain the 
following inequality 

— + -tr ( KK'{t)D^^ + f{t, X, hyD<t> + 0g{t, a;, /i)$ > 



dt 2 
almost everywhere in Qn 
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Using a measurable selection theorem and following an argument similar to that of 
Lemma VI. 6.1 of Fleming and Rishel [13], we see that there exists a Borel measurable 
function h* from (0,T) x into J such that 

/(i, X, h*)'D^ + eg{t, X, h*)^ = mf^ x, h)' + eg{t, x, 

holds for almost ah {t,x) £ (0,r) x Then 
a<|fe 1 



{KK'(t)D'^^''^ + f{t, X, h*)'D¥ + 0g{t, x, 
< ^ + itr (AA'(t)i:>2|>'^) + f{t, X, h'')'D¥ + Bg{t, x, h'')^ 



dt 2 
1 



dt 



dt 



-tr 
2 



AA'{t)D^^'') - AA'(t)L>2$ 



2^fe+l 



+/(t, /i'')' (^D** - D^'^+i j + 6ig(i, X, /i*) (^^'^ - 
Since the left-hand side of (|5.13p tends weakly in L^[Qh) to 

^tr (AA'{t)D'^^) + fit, x,h*yD^ + 0g{t,x,h*)^ 



(5.13) 



9t 2 

as fc — oo and the right-hand side tends weakly to 0, then we obtain the inequality 

— + -tr (^AA'(t)L>2$j + f{t, X, h*)'D<i> + eg{t, x, h*)<^ < (5.14) 
almost everywhere in Q^. 

Combining (|5.12p and (|5.14l) . we have shown that 

^ + itr (AA'{t)D^^) + fit, X, h*)'D^ + eg{t, x, h*)^ = 



dt 2 
almost everywhere in Qj^ 



Hence, $ is a solution of equation p. 151) on a bounded domain. Moreover, 
l> e C\Qr). Also, since H is locally Lipschitz, l^'^lg^ < oo for > and 
1^^ < oo for /X > 0, then |ff(i, a;, $'=,£>$'=) < oo. 

1:2/ 



We can now show that $ e Ci^2((5ii). Define 



Consider the following estimate given by equation (ElO) in Appendix E of Fleming 
and Rishel 



|l'|^t''<M2||^||Q'^ 
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(5.15) 



for some constant M2, and two open subsets Q' and Q" of Q such that Q' C Q"- In 
this estimate, set Q" — Qr and take Q' to be any subset of Q such that Q' C Q. 
Thus 

|l'||t'' < 00 (5.16) 

When interpreted in hght of estimate (|5.10p (stemming from (E9)), we see that the 
derivatives and ^ satisfy a uniform Holder condition on any compact 

subset Q' of Qr. By Theorem 10.1 in Chapter IV of Ladyzhenskaya, Solonnikov and 
Uralceva [23], we can therefore conclude that $ S C^''^{Qr)- 

Step 3: Convergence from the Cylinder [0,T) x .'^r to the State Space 

[o,r) X M" 

step 3.1: Setting 

Let > be a non decreasing sequence with limi_j.oo Ri ^ 00 and let {rij^gp^ 

be the sequence of stopping times defined as 

Ti := inf {i : X{t) ^ .^rJ 

Note that {rijjgpj is non decreasing and linii_>oo Ti — 00. 

Denote by the limit of the sequence on (0,r) x ^r., i.e. 

^^''>{t,x) = lim ^''{t,x) \/{t,x) £ (0,r) x 3§R, (5.17) 



Step 3.2: Convergence of the sequence ($'^*' ) 

V / ieN 

First, observe that the sequence (<&(*' )igN is non increasing. Indeed, for i < j the 
stochastic control problem defined over (0,T) x 3§r. is nested into the stochastic 
control problem defined over (0, T) x ^r - . In particular, a suboptimal strategy for 
the stochastic control problem defined over (0, T) x £§r^ would be to invest optimally 
while X G 3§R^ and then switch to the 0/3 policy h when x G SSr\SiSr^. By the 
optimality principle, the expected total cost of such strategy is greater than the value 
function ^^^^ . But this suboptimal strategy also corresponds to the optimal strategy 
for the stochastic control problem defined over (0, T) x !^r^ . Hence 

(t, x) > l>'^') {t, x) Vi , J G N, \f(t, x) e{0,T)x .^R^ 

By the argument in Step 1, the sequence ($('^)igN is also bounded. As a result, 
it converges to a limit This limit satisfies the boundary condition (|3.16p . We now 
show that $ is C^'^ and satisfies the HJB PDE. These statements are local properties 
so we can restrict ourselves to a finite ball Qr. 

Step 3.3: Proving that $ e C^'^{Qr) 
Using the following estimate given by equation (E8) in Appendix E of Fleming and 
Rishel 
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for some constant M, we deduce that 

ll^«ll^'L<M||*|||,%Q^ (5.18) 

which, combined with assumption (|5.4p . imphes that 11'^''^"'^^ |||,'^q^ is bounded for 77 > 1. 
Criticahy, the bound M does not depend on k. Moreover, by Step 2 and D^*^*' 
are uniformly bounded on any compact subset of Qo- By equation ()5.18p we know 
that ll'&ll,, is bounded for any bounded set Qr C Qo- 

On Qfl, l>'^*' also satisfies the Holder estimate 

||.W|^+/<Mi||$«||(2)^^ 

for some constant Mi depending on Qn and -q. Recalling that H{s,x,p) is locally 
Lipshitz and taking into account the estimate (jS.lSp (i.e. condition (ElO) in Appendix 
E of Fleming and Rishel [13] and also Theorem 10.1 in Chapter IV of Ladyzhenskaya, 
Solonnikov and Uralceva [23]), we find, that ^^j— and gj^^ ' also satisfy a uniform 
Holder condition on any compact subset of Q. 

By Ascoli's theorem, we can find a subsequence I ) of ( ) such that 

V / leN \ / ieN 

(i) . ( ^'^ J tends to a limit $ uniformly on each compact subset of Qq ; 
v / leN 

^ j tends to a limit ^ uniformly on each compact subset of Qq; 

(iii) . ( I tends to a limit £)$ uniformly on each compact subset of Qq; 

\ I im 

(iv) . ( ) tends to a limit uniformly on each compact subset of Qo- 

Finally, the function $ is the desired solution of equation (|3.15p with terminal 
condition $(T,x) = e"*'"" 
□ 

We used the following Lemma in Step 2.1 of the proof of Theorem 15.21 

Lemma 5.3. // the junction u{t, x) satisfies the PDE 
du 1 

— + -tr{AA'{t,x)D^u) + f{t,xYDu + eg{t,x)u + iit,x) = (5.19) 

in Q := (0,T) x G,G C M", and subject to boundary conditions 

$(i, x) = *(t, x) V(t, x)ed*Q ((0, T) x dG) U ({T} x G) 

then 



■T/\TG 



u{t, x) — 'E 

L 

(5.20) 

where 
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• The n-dimensional diffusion process X(t) satisfies the SDE 

dX{t) = f{t, X(t))dt + A{t)dW{t), X{0) = X 

where W{t) is a n-dimensional Brownian motion 

• the stopping time tq is defined as 

TG :=inf{i : X{t) i G} 

Proof. Take a non-decreasing sequence of stopping times {rjl^.^pj. Define Ur 
u{r,Xr) and Zj := 9 g{s, Xs)ds. By the Ito product rule, 



TjArcAT 



tjAtgAT 



u{t, x) + 



9g{s,x)Use^'ds + 
A{s,Xs)e^<-'UW{s)- 



£{s,Xs)e^'ds 



ds 



AUs e^'ds 



TjAtgAT 



£{s,Xs)e^'ds 



where Af is the generator of the function F{t,Xt), defined as 

AF{t, x) := f{t, xyDF{t, x) + ^tr {AA'{t, x)D^F{t, x)) 
Rearranging and taking the expectation on both sides of (|5.2ip . we get 

pTjAToAT 



(5.21) 



E 



UTjATcATe ^ ^ + 



i{s,Xs)e^'-ds 



u{t,x) + E 



' —+AUs+ eg{s, x)Us + €(s, Xs) ) e^^ds 



Taking into account relation (|5.19p . this equation simplifies to 

fTjAToAT 



u(t, a;) = E 



(5.22) 



(5.23) 



Letting j -> cxd, equation (|5.23p converges pointwise to 

rTATG 



u{t,x) = E 



e{s,Xs)e^^^'s{r,X^)dr^^ 

(5.24) 



Corollary 5.4 (Existence of a Classical Solution for the Risk-Sensitive Control 
Problem). The RS HJB PDF p.lip with terminal condition $(T, x) = Inw has a 
solution $ e (7^-2 ([0,T] x R") with $ continuous in [0,T] x R". 
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6. Summary of Main Results. We now conclude by deducing that $ is 
the unique classical solution of the HJB PDE (|3.11[) subject to terminal condition 
$(T, x) = Inu and that a similar result holds for the exponentially transformed value 
function $. We will also show that the value function $ is convex. 



Theorem 6.1. 

Choosing as optimal control the unique maximizer of the supremum p.l9|) . $ is 
the unique C^'^ ([0,T] x M") n C([0,r] x R") solution of the RS HJB PDE (jXTS]) - 
p.l6p . Moreover, $ satisfies the property (j6.2p . 

Proof By Corollary EH $ is a C^'^ ([0,r] x R") n C([0,T] x M") solution of 
the RS HJB PDE (|3.15l) - (|3.16p . Moreover, the existence of zero beta policies enable 
us to deduce (as in Step 1 of the proof of Theorem 15. 2p that $ is bounded. Part 
(i). of Theorem 14.11 therefore applies to Choosing as optimal control the unique 
maximizer of the supremum p.l9|) . part (ii). of Theorem 14.11 also applies: $ is the 
unique solution to the HJB PDE. Property (|6.2p is proved in Corollarv l6.3l □ 

This result proves that we have solved our original control problem in the context 
of strong, classical solutions. What would this imply in terms of weaker viscosity 
solutions? As a classical solution is also a viscosity solution, our result implies that 
the value function is indeed a viscosity solution of the HJB PDE. However, uniqueness 
of classical solutions does not necessarily imply uniqueness of viscosity solutions. To 
prove uniqueness in the viscosity sense, we would need a comparison result such as 
Theorem 33 in Davis and Lleo jll| . 



Proposition 6.2. The value function is convex in x. 

Proof. 

To prove that the value function x) is convex in x, it is necessary and sufficient 
to show that V(2;i,X2) G R" and for any k, e (0, 1), 



$(t, KXi + (1 - n)x2) < K$(t, xi) + (1 - X2) 
Start from the left hand side: 



(6.1) 



$(i, KXi + (1 - k)x2) 

1 



sup 

heA(T) 



■In E. 



t,KXl-\-{l — K,)x2 



1 



sup --ln'Et,{xi, 

heA{T) t/ 



1 



sup 

h£A{T) 



■InE. 



t,{xi,X2) 



exp 



exp < K 




giXs,his);9)ds^e\nv\x(t) 



g{KXi{s) + (1 - K)X2{s),h{s);e)ds -d\nv} x{t) 



g{Xi{s),h{s);9)ds 



(1 - ^)dj^ g{X2{s),h{s)-e)ds -e\nv\x{t) 



= sup -ilnEt^(^j_^2) 
heA{T) t/ 



exple / g{Xi{s),h{s);9)ds ^ 9\nv \ x{t) 



X exp ■ 



g{X2is),his);9)ds-e\nv}xit) 
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< sup --In-^ Et,:ri 
hGA{T) f I 



exple / giXi{s),h{s)-0)ds-einv}x{t) 



xEt, 



exp<^6' / g{X2is),h(s);e)ds - 0\nv} x{t) 



1 



= sup <--lnEi_^^ 

heA{T) I 



exple g{Xi{s),h{s);9)ds-e\nv}x{t) 



t,X2 



exple J g{X2is),h{s);9)ds ~9\nv^x(t) 



< sup --lTi'E,t,xi 

heAiT) tl 



■ sup --InEt.a;^ 



expie / g{Xi{s),h{s);9)ds ~ e\nv } xit) 



exple / giX2{s),h{s)-0)ds - 0lnv} x{t) 



< sup --InEt.xi 

heAiT) ^ 



exple I g{Xi{s),h{s)-e)ds - e\TLv) x{t) 



1 — K 

■ sup — In E 

h£A{T) ^ 



t,X2 



exp<{6' / g{X2{s),h{s)-e)ds ^ e\nv\x{t) 



+ (1 - K)$(t, 2:2) 



where 



• the fifth fine fofiows from the fact that the covariance of two random variables 
inside the expectations is positive; 

• the eighth fine is due to the fact that the function x H> a;" for a: > and 
a £ (0, 1) is concave. 



Corollary 6.3. The exponentially transformed value function $ has the follow- 
ing property: V(a;i, X2) G M^, k G (0, 1, ), 



KXi + (1 - k)x2) > ^""{t, X2) 



3.2) 



Proof. The properties fofiows immediately from the definition of ^{t,x) — — ^ In x). 



□ 



The following corollary applies to the risk sensitive value function 
Corollary 6.4. 

Choosing as optimal control the unique maximizer of the supremum p.l9p , $ is 
the unique C^'^ {[0,T] x M") n C([0,T] x M") solution of the RS HJB PDF (jXTT]) - 
p.l3p . Moreover, $ is convex in its second argument x. 

Note that the approach presented in this article extends naturally to a jump- 
diffusion version of the risk-sensitive benchmarked asset management problem intro- 
duced by Davis and Lleo [10, and would yield similar results, namely the existence of 
a unique admissible control policy and of a classical C"^'^ solution to the associated 
RS HJB PDE. 
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7. Partial Observation. In this section we show how the resuhs of the paper 
can be extended to the case where the factor process X{t) is not directly observed and 
the asset allocation strategy ht must be adapted to the filtration = a{Si{u), < 
u < t,j = 0, . . . , m} generated by the asset price processes alone. In the linear 
diffusion case studied by Nagai p7 and Nagai and Peng [28], the authors noted 
that the pair of processes {X{t),Y{t)), where Yi{t) = logS'i(t), take the form of the 
'signal' and 'observation' processes in a Kalman filter system, and consequently the 
conditional distribution of X{t) is normal N{X{t),P{t)) where X{t) = E[X(t)|jf] 
satisfies the Kalman filter equation and P{t) is a deterministic matrix-valued function. 
By using this idea they obtain an equivalent form of the problem in which X{t) is 
replaced by X{t) and the dynamic equation (|2.2I) by the Kalman filter. Optimal 
strategies take the form h{t, X{t)). This is in fact a very old idea in stochastic control, 
going back at least to Wonham |35| . 

7.1. Decomposition. At first sight it does not seem apparent that the same 
approach can be used here, as the price processes contain jumps, but a simple ob- 
servation shows that the jumps play no role in the estimation process, which is still, 
at base, the Kalman filter; see Proposition 17. II below. A further complication is that 
the money market interest rate r{t) — qq + AQX{t) (see p.3p ) is observed directly 
and contains information about X{t). This was not the case in and |2H] where, 
in our notation, Aq = 0. We start by assuming that Aq = 0, and briefly discuss the 
extension to 7^ at the end of the section. 

Recall first that X{t) satisfies 

dX{t) = {b + BX{t))dt + AdW{t), X{0)^Xo (7.1) 

When Xt is observed, the initial value Xq is just a constant. In the present case we 
need to assume that Xq is a normal random vector N{'mo^ Pq) with known mean toq 
and covariance Pq, and that Xq is independent of the processes W, Np. 

An application of the general Ito formulelfl shows that for i = 1, . . . , m the log- 
prices Yi{t) satisfy Yi{0) = logs^ and 



dY,{t) 



{a + AX{t)),--j:^ 



1 ^ 



dt + J2'^^kdWk{t) 



k=l 

+ [ {ln(l+7,(z)) -7,(z)}iy(dz)dt+ / \n{l + -f,{z)) Np{dt,dz). (7.2) 
Proposition 7.1. Define processes Y^{t),Y'^{t) e M™ as follows. 

dY\t) = Ax{t) + i:dw{t), y/(o) = o, (7.3) 

dY^{t) = c,dt + J \n{l+ -f{z))^Np{dt,dz), i = l,...,m, Y^{0) = \ogs, 
with c e M™ defined by 

c, := a, - iEE',, + f {In (1 + 7,(z)) - 7,(2)} z.(dz) 

Zo 

so that Y{t) = Y\t) + y2(t). Also, define = a{Y'{u),0 <u<t},i^l,2. Then 



^See 0ksendal and Sulem |22| for this calculation. 
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(i) The processes Y^,Y'^ are each adapted to the filtration J-f 

(ii) For any bounded measurable function f and t > 0, 

E[/(X(t))|J-f]=E[/(X(i))lyH]. 



Proof, (i) S{t) and Y{t) arc in 1-1 correspondence and therefore generate the 
same filtration J^f . Apart from rearrangement of deterministic terms, the decompo- 
sition Y = + is the same as the standard decomposition Y ~ Y'^ + Y'^ of a 
semimartingale into its continuous and discontinuous components, see paragraph VI. 
37 of Rogers and Wilhams [3T1 . 

(ii) A^p and W(t) are independent and as a result yu and 3^2t are independent, and 
clearly — V 3^2*- The result follows, since X{t) is independent of 3^2* ■ D 

7.2. Kalman Filter. The processes {X{t),Y^{t)) satisfying ([711 and (TO)) and 

the filtering equations, which are standard, are stated in the following proposition. 

Proposition 7.2 (Kalman Filter). The conditional distribution of X{t) given 
yit is N{X(t), P(t)), calculated as follows. 

(i) The innovations process U{t) G R™ defined by 

dU{t) = {Y.Y.')-^^^{dY\t) - Ax{t)dt), U{0) = (7.4) 

is a vector Brownian motion. 

(ii) X{t) is the unique solution of the SDE 

dX{t) = {b + BX{t))dt + (as' + F(t)i') (EE')"^/^dJ7(<), A(0) = mo. (7.5) 

(Hi) P{t) is the unique non-negative definite symmetric solution of the matrix Riccati 
equation 



P{t) = AEE'A' - P{t)A'{J:j:')-^AP{t) +i^B- AS'(EE')"^^j Pit) 

+P{t) {b' - i'(EE')"^SA') , ^'(0) = ^0 

where 5 := / - S' (E'E)"^ S. 

To conclude, the Kalman filter has replaced our initial state process X{t) by an 
estimate X{t) with dynamics given in (j7.5|) . To recover the asset price process, we 
use (|7.3p together with (|7.4I) to obtain the dynamics of Y{t): 

dY,{t) = dY^\t)+dY^{t) 

= a, + AX{t)dt - ^EE-,,dt -I- (EE'^/^dUit) 



+ / {lnil+Jdz))-l^{z)}ly{dz)+ / ln(l+7(z)),Ap(dt,dz). (7.6) 

JZo Jz 

We then apply Ito to Si{t) — expYi{t) to get 

- ia + AXit))dt + y] [(EE')^/'l dUkit)+ [ j,iz)Np{dt,dz), 

Si{Q) = Si, i = 1,.. .,m (7.7) 
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We now solve the stochastic control problem with partial observation simply by 
replacing the original asset price description (j2.4p by (j7.7p . and the factor process 
description (|2.2p by the Kalman filter equation (I7.5|) . in our solution of full observation 
case. The Kalman filter has time-varying coefficients, but this does not affect the 
preceding arguments. 

Finally, we briefly sketch what to do if 0. We observe the short rate 

r{t) = ao + A'QX{t), and hence the f -dimensional statistic lo(t) = AQX{t), exactly. 
We need to assume that this observation contains positive 'noise', i.e. A'qAA'Ao > 
0. Changing coordinates if necessary, we can assume that A'q — (0,0, ...,f) and 
hence Yo{t) = Xn{t). Our 'observation' is now the (m -I- l)-dimensional process 
Y = (Yq, . . . , Ym) and we can set up a Kalman filter system to estimate the unobserved 
states X — {Xi, . . . , Xn-iY £ M"^^. Ultimately, our optimal strategy will take 
the form h{t, Xn{t), X{t)), where X{t) is the Kalman filter estimate for X{t) given 
{Y{u),u < t}. The details are left to the reader. 

8. Conclusion. In this article, we extended the classical risk-sensitive asset 
management setting to include the possibility of infinite activity jumps in asset prices. 
We applied the change of measure technique proposed by Kuroda and Nagai [23] to 
derive the Hamilton- Jacobi-Bellman Partial Differential Equation associated with the 
control problem and then proved the existence and uniqueness of an admissible opti- 
mal control policy. Using an approximation in policy space algorithm, we established 
the existence of a classical C^'^ ((0,r) x M") solution and obtained the uniqueness 
of this solution through a verification result. This approach also extends naturally 
and with similar results to a jump-diffusion version of the risk-sensitive benchmarked 
asset management problem. 

Finally, we have observed that an attractive, if somewhat surprising, feature of 
the jump diffusion risk sensitive asset management is that it naturally prohibits any 
investment policy which may result in the investor's bankruptcy. In particular, in 
the risk-sensitive setting presented in this article, an investor who implements the 
optimal asset allocation is certain of remaining solvent over the investment horizon. 
This contrasts with the Merton type of approach in which the threat of bankruptcy 
remains present and has to be accounted for using a stopping time. 

Appendix A. Admissibility of the Optimal Control Policy. The admis- 
sibility of the optimal control process h*{t) solving (|3.19p is linked to the existence 
of a probability measure P^, , which itself hinges on the characterisation as an ex- 

ponential martingale of the Radon-Nikodym derivative = Xt defined in IjS.Sp 

via the Doleans exponential introduced in (|2.8I) . In the setting of Kuroda and Na- 
gai [231 , the admissibility of the control follows easily from an argument in Gihman 
and Skhorokhod [18^ which proves that the the Doleans exponential (here a Girsanov 
exponential with Gaussian integrand) is an exponential martingale. However, when 
the Doleans exponential does not have continuous path, as is the case in a jump dif- 
fusion setting, proving that it is indeed a martingale is more difficult. As noted by 
Protter [3^, some partial results exist in this case (see for example Memin [Ml and 
more recently Protter and Shimbo [29]), but none is as powerful as their counterparts 
in the continuous case, namely the Kamazaki or the Novikov conditions. 

To show that the Doleans exponential introduced in (|2.8p is a martingale we will 
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apply results derived by Memiii [26 . We recall here the definition of the Doleans- 
Dade exponential as well as results from Memin (see also Exercise 13 in Chapter 
V of [30]) on the multiplicative decomposition of local martingales that we will use to 
prove our point. 

Definition A.l (Doleans-Dade exponential). The Doleans-Dade exponential 
£{X){t) of a semimartingale X (t) is defined as 

5(X)(^) = cxp|x(^)-i[x^x^]A n (i + AX,)e-^^= (A.l) 

^ 0<s<t 

Definition A. 2 (Memin's Additive Decomposition of Local Martingales). 
Let M(t) be a local martingale. We define an additive decomposition of M into 
two processes Mi{t) and M2{t), i.e. such that M{t) = Mi{t) + Ahit). 

In this decomposition, the process Mi{t) is defined as Mi{t) ~ L{t) — L{t) where 

0<s<t 

and L{t) is the compensator of L{t). 

Proposition A. 3 (Memin's Proposition III-l). Let M{t) he a local martingale 
with additive decomposition as per definition \A.2\ and such that Mq = 0. Then 
(i) £{M) has the decomposition 

£{M) = £{M2)£{Mi) 



where 



Mi{t)^MAt)~ > --^ t<oo 

^' ^' ^ l + AAf2(s) 



(ii) £{M2)Mi is a local martingale. 

(Hi) If AM{s) > -1 then AMi(s) > -1 for all finite s. 

Corollary A. 4 (Memin's CoroUary III-2). Let N be a local martingale such 
that AN{s) > —1 for all finite s, and such that £{N{oo) is uniformly integrable. Let 
P' be the probability defined as 

dF' 

^ =£(A)(oo) 
dF ^ ^ 

Let Ni be a local martingale with locally integrable variations and denote by Ni 
the F -semimartingale defined as 



0<s<t 



/ ^ ,r . ^ AAi(s)AA(s) 
mit) ^ N,it) ^ 1 + AA(.) ' 



then Ni is a F' local martingale, with locally integrable variations. Moreover, the F' 
predictable compensator o/^q^^^^ |AAi(s)| is equal to the F predictable compensator 
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o/Eo<s<JAiVi(s)|. 

Theorem A. 5 (Memin's Theorem III-3). Let M{t) he a local martingale with 
additive decomposition as per definition \A.'A If the predictable compensator of the 
process 

Y{t)^[M^,M%+ \^M,{s)\+ {AM,{syf (A.2) 

0<s<t 0<s<t 

is bounded, then £{M){t) is uniformly integrahle. 

Proof of Proposition \4^.S\ To prove that the control h*(t) is admissible, we need 
to show that the local martingale M*{t) defined as 

M*{t):^~0 f h*{s)'Y.dWs- j f In {I- G{z,h*{s); 9)) Npids,dz) (A.3) 
Jo Jo Jz 

and such that 

mm = x: 

is an exponential martingale. 

To achieve this objective, we will define a new class of control processes to which 
the optimal control belongs. We will start from the definition of a control h as a. 
function: 



h : [0,T] X K" ^ J 

(t, x) H> h(t, x) 



where the set J' was defined in (|2.6[) . Based on this definition, the control space can 
be viewed as a functional space. 

Define the functional C{x,p, h) as 

C{x,p, h) := {9 + 1) /I'EE'/i - 9h'Y,k'p + h'{a + Ax) 

-\ J {[{l + h'jiz))-' -l\+9h'jiz)lz„iz)}i^idz) 

(A.4) 

where p G M" so that 

sup L'l^ = {b + Bx)' D4> + -tr (AA'D^^) _ -(£)$)' AA'D$ + oq + A'^x 



+ sup£{x,D^,h) 
heJ 



(A.5) 



and the unique maximizer of h*{t,x), is also the unique maximizer of 

C{x,D<i>,h). 
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Observe that with the choice of control function h°{t, x) :— V(t, x) e [0, T] x R", 
the functional C{x,p,h") — V{t,x,p) e [0,T] x M" x M". Invoking the optimality 
principle, we deduce that C{x, D^, h* {t, x)) > 0. 

Denote by J the range of the control functions h{t,x) such that £{x,p,h) > 
0. Under Assumption [l] the set J', defined by (|2.6p . is in the interior of a hy- 
percube and since the functional C{x,p,h) is smooth, strictly concave in h and 
limfi^Qj C{x,p,h) = — oo, we deduce that the set JT" is a closed convex subset of 
J for all (t, x) G [0, T] x R". The control functions h take the form 

{t,x) h{t,x) 

More formally, we can define a class 'H{T) of Markov control processes as 

Definition A. 6. A control process h(t) is in class 'H{T) if the following condi- 
tions are satisfied: 

1. h{t) is in class % introduced in Definition \2.2^ 

2. hit,x) e J \/it,x) G [0,T] X R". 

In particular, we note that the optimal control process h*{t) G H{T) Vt G [0,r] 
and Voj G fl. 

For any control policy h{t) G 'H{T), define the local martingale M{t) as 

M{t):=-eJ h{syY.dWs- J \n(l-G{z,h{s);9fj Np{ds,dz) (A.6) 
Also, let L{t) be the process defined as 

0<s<t 

= -/ f In (l - G{z,h{s);0)\ Np{ds,dz) 

■^0 Z\Zi 

where Zi = {z G Z : |Ai;| < ^,0 < s < t}. Then, the process Mi(t) := L{t) - L{t) 
can be expressed as: 

Mi{t) = - I I \n(l -G{z,h{s)]Q))Np{ds,dz) 

To complete our decomposition of the local martingale M{t), we define the process 
M2(t) as 

M2it) = M{t) - Mi{t) 

= -[ I Infl -G{z,h{s); 9)) Np{ds,dz) 

The next step is to study each component of the process Y{t) defined in (jA.2[) : 
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The process 



[M", M% = exp |6i2 y h{s)'i:T.'h{s)ds 

is clearly bounded because h{s) G 7^(r) for all s G [0,^]; 
The process 



^ |AMi(.)| = /* / 
0<s<t -^0 -^zxzi 



In (1 - G(z,/i(s);6i)) iVp(ds,dz) 



is bounded because h{s) £ %{T) for all s G [0,t]. In addition, the number of 
jumps greater than -i is finite: 

# {0 < s < t; |AMi(s)|} = # {O < s < t; |AM(s)| 

1 



= TV l^i, -oo, 
< oo 

• Finally, we turn our attention to the process 

0<s<t "^0 "'^1 



U 



\TL[l-G{z,h{s)-e)] Np{ds,dz) 



Recalling that we assumed that in our setting 

\'y{z)\'^iy{dz) < oo 



and taking into consideration the fact that h{s) G H{T) for all s G [0, t], then 
we deduce that 



In (^1 - G(z,/i(s);6l)j v{dz) < oo 
for any w G 11, which proves that 

\n(l -G{z,h{s);9)j ^Np{ds,dz) 



JZi 



< oo 



By Theorem IA.5[ the Doleans-Dade exponential 

mm - x*t 

is uniformly integrable for all h G i-L{T). We can now apply Corollary I A.4I to formally 
define the measure P~. In particular, the measure P^, characterized via the Radon- 

Nikodym derivative Xt ^^^^ defined because h*{t) G i-L{T) Vui G fi. This proves 

that the control h*{t) is admissible for all t E [0,T] and cj G 11. 

□ 

Note that the control policy = corresponds to investing the entire wealth 
into the money market asset for the duration of the investment period. The associ- 
ated measure P^o is well defined and it is equal to the physical measure P. In fact, 
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this proof not only shows that the optimal control process h*{t) is admissible, but 
also that a large class of "reasonable" control processes h{t) is also admissible and is 
associated with a well-defined probability measure. 



Proof of Proposition \4.4\ Consider the exponentially transformed problem inf Jix, t, h; ( 

where 



J{x,t,h;0) := InE 



-einV{t,x,h) 



(A.7) 



Note that because the term e sinV{t,x,h) bounded from below by 0, inf/,g_4('r') J{x, t, h; 
is well defined which implies that there exists at least one minimizer h. 



E 



-B\nV{t,x,h) 



E: 



h,e 



(see for example Lemma 8.6.2. in [3T]) and hence 



inf E 

heA(T) 



nnV{t,x,h) 



inf e: 



exp < 6* / g{s,Xs,h{s);9)ds — d\nv 



g{s, Xg, h{s); 9)ds ~ d\nv 



h,e 



exp ■ 



heA{T) 

I{v,x;h*{t);t,T; 



which proves that the optimal control h*{t) for the auxiliary problem sup^g_4(-y) I{v, x; h; t, T; 
derived in Section [3. 31 is indeed optimal for the problem sup/jg^cj^-i J{x, t, h; 0). 
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